Managing Considerable Distributed Resources for Demand Response: A Resource Selection Strategy Based on Contextual Bandit

نویسندگان

چکیده

The widespread adoption of distributed energy resources (DERs) leads to resource redundancy in grid operation and increases computation complexity, which underscores the need for effective management strategies. In this paper, we present a novel approach that decouples selection power dispatch tasks. task determines subset designated participate demand response service, while output selected candidates. A solution strategy based on contextual bandit with DQN structure is then proposed. Concretely, an agent action, solved environment. negative value operational cost used as feedback agent, links two tasks closed-loop manner. Moreover, cope uncertainty problem, distributionally robust optimization (DRO) applied reserve settlement satisfy reliability requirement against uncertainty. Numerical studies demonstrate DQN-based can achieve profit enhancement ranging from 0.35% 46.46% compared policy gradient under different quantities.

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Distributed Online Pricing Strategy for Demand Response Programs

We study a demand response problem from operator’s perspective with realistic settings, in which the operator faces uncertainty and limited communication. Specifically, the operator does not know the cost function of consumers and cannot have multiple rounds of information exchange with consumers. We formulate an optimization problem for the operator to minimize its operational cost considering...

متن کامل

Bidding Strategy in Demand Response Exchange Market

Demand response (DR) has many beneficiaries in the electricity market. There are independent players who are interested in DR, which include: transmission system owners, distributors, retailers, and aggregators. In this paper DR is introduced as a tradable commodity that can be exchanged between DR buyers and sellers in a pool-based market which is called demand response exchange (DRX). DRX ope...

متن کامل

A Contextual Bandit Approach for Stream-Based Active Learning

Contextual bandit algorithms – a class of multiarmed bandit algorithms that exploit the contextual information – have been shown to be effective in solving sequential decision making problems under uncertainty. A common assumption adopted in the literature is that the realized (ground truth) reward by taking the selected action is observed by the learner at no cost, which, however, is not reali...

متن کامل

A Distributed Scheme for Incentive-based Residential Demand Response

Demand response (DR) enables customers to adjust their electricity usage to balance supply and demand. Although commercial and industrial DR have been in use for decades, residential DR is still emerging. The goal of this paper is to design a DR scheme for residential customers. We consider households operating various appliances and propose a distributed DR scheme to manage the appliances to k...

متن کامل

Adaptive Representation Selection in Contextual Bandit with Unlabeled History

We consider an extension of the contextual bandit setting, motivated by several practical applications, where an unlabeled history of contexts can become available for pre-training before the online decisionmaking begins. We propose an approach for improving the performance of contextual bandit in such setting, via adaptive, dynamic representation learning, which combines offline pre-training o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Electronics

سال: 2023

ISSN: ['2079-9292']

DOI: https://doi.org/10.3390/electronics12132783